Optical character recognition technique for protected viewing of digital files

ABSTRACT

Unlocking digital content embodied in digital readable form on a digital media carrier includes receiving a scanned image of a page from scanning a physical copy of content, evaluating the scanned image; and if the scanned image corresponds to a selected page of the digital content, unlocking the digital content.

BACKGROUND

This invention relates to techniques for unlocking digital content.

Some publishers publish books and other content on computer readable media such as compact disks (CD's). Other types and formats of digital media may be used to publish a book or other type of printed publication in digital form, such as a digital video disk, magnetic disk and the like. Generally, when content such as books and the like are published in a digital format they are published in a bitmap or PDF type format.

Publishers are understandably concerned about publishing content in digital format, in general, and accessible digital format in particular, because material published in digital format such as on a CD is easily illegally copied and distributed. In digital format, publishers risk losing control of a copyrighted work. Some publishers avoid making some or all of their content available in digital format, or protect the content to make the content difficult to copy and to access. Some schemes use encryption that requires a key for instance to unlock the CD to allow a user to retrieve text representation of the content from the CD.

Accordingly, publishers might want to protect content to prevent users from being able to view the content, duplicate the content exactly (e.g. by duplicating the entire CD), printing the content (e.g. printing a copy of the book from the CD), and extracting content, such as text or images, and using the extracted content by reformatting the extracted content, or translating the extracted content or incorporating the extracted content into some other form, such as for searching or indexing.

PDF format has an elaborate set of protections that address some of these protection issues. For example, in PDF, printing can be locked, as can be copying of text, or passwords can be used to lock and unlock copying and printing. In some applications, PDF uses “back door” methods that allow screen readers for the blind to access text, via secret passwords that publishers share with the manufacturers of the screen readers.

A reading machine needs text in a digital form to read content aloud for a blind person or a person with learning disabilities. Some reading machines use scanners and optical character recognition (OCR) software to input pages as text into the reading machine.

SUMMARY

The current state of acceptance of publishing printed content in digital form is an obstacle for blind users and other users with learning disabilities. While some reading machines use scanners and optical character recognition (OCR) software to input pages as text into the reading machine, such an approach requires a scanner and relatively expensive OCR software. Moreover, OCR processing is relatively slow.

Publishers do not necessarily want to publish content in digital format. Even for CD's storing digital content in an image format, e.g., PDF format, the CD is still easily copied. PDF is not equivalent to bitmap format, though its content can be optionally protected in various ways. Thus, if a publisher makes a bit-mapped or PDF version of the book available on CD the book can be viewed on the screen. While in PDF format, a user cannot extract a text file unless the user also has OCR software, the entire CD can nevertheless be copied. Thus, the book embodied in digital content on a CD can be copied relatively inexpensively, whereas the physical book is much harder to copy, and copying of an entire book is less practical.

Even when publishers publish books and the like on CD's publishers are reluctant to distribute the CD's, because users may copy the CD and avoid buying a physical copy of the book. Providing content available in digital formats such as CD's is desirable, because in some ways CD's are more accessible than a physical copy of a book. CD's are searchable and CD's can store additional content features. However, what restrains greater acceptance of publishing books on CD's is that there is no easy way to prevent copying of the CD's.

According to an aspect of the present invention, a method for unlocking digital content embodied in digital readable form on a digital media carrier includes receiving a scanned image of a page from scanning a physical copy of a printed publication associated with the digital content, evaluating the scanned image, and if the scanned image corresponds to a selected page of the printed publication, unlocking the digital content.

The following are embodiments within the scope of the claims. Evaluating further includes applying optical character recognition to the scanned image to produce a text representation of the scanned image and comparing text representation to a stored version of the text to determine if the scanned image corresponds to the selected page of the digital content. Evaluating includes applying optical character recognition to the scanned image to produce data that represents an arrangement of text in the scanned image and comparing the data representing the arrangement of text to a stored version of the arrangement to determine if the scanned image file corresponds to the selected page of the digital content. The page is selected at random. A digital camera or a web cam is used to scan the page and evaluating includes detecting a pattern of color in the data from the digital camera or web cam. A scanner is used to scan the page. The digital content is unlocked to provide a text based searchable file. Applying optical character recognition generates a signature that is compared to a signature generated from the stored file to determine whether the user had the book. The digital content is a book stored on a digital storage media and the physical copy of the digital media is a printed copy of the book. The digital content is unlocked to provide text. The method includes applying speech synthesis to convert the text from the digital content into speech that is read aloud to the user.

The method includes displaying a representation of a portion of the printed copy of the digital media on a display, applying a highlighting indicia to an item in the displayed portion of the printed content, in synchronization with reading the item to the user. The digital content includes a book stored in a digital representation on the digital storage media and the physical copy of the digital media is a printed copy of the book. Displaying, displays an image representation from the digital media, and the method further includes applying speech synthesis to convert the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the image representation in synchronization with reading the text. The method includes displaying a representation of a portion of the digital media on a display, using speech synthesis to convert text corresponding to the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the displayed portion in synchronization with text corresponding to the item being read.

According to a further aspect of the present invention, a computer program product for unlocking digital content embodied in digital readable form on a digital media carrier includes instructions to receive a scanned image of a page from scanning a physical copy of content, evaluate the scanned image; and if the scanned image corresponds to a selected page of the digital content, unlock the digital content.

According to a further aspect of the present invention, an apparatus includes a computing processing device and a computer readable medium storing a computer program product for unlocking digital content embodied in digital readable form on a digital media carrier includes instructions to receive a scanned image of a page from scanning a physical copy of content, evaluate the scanned image; and if the scanned image corresponds to a selected page of the digital content, unlock the digital content.

One or more aspects of the invention may provide one or more of the following advantages.

In many configurations of computers, such as in reading machine applications, it is common to have a scanner available. The invention uses a scanned image to unlock the digital content on a digital media carrier. Optical Character Recognition (OCR) processes the image file. By applying one of several processing types the invention determines if there is a sufficient match between a heuristic processed from the scanned page and a stored heuristic. Processing can involve full recognition of text, or generating a signature using less than full OCR processing to determine that the scanned image page was sufficiently correct. Correspondence between the heuristic representing the scanned image and the stored version expected need only be close enough to assure that the user has the correct page and physical possession of the book.

The unlocked digital content can include a text file, a searchable, indexed text file, and special or enhanced features such as an animated presentation using a media player. Unlocking digital content can also include unlocking audio content on a web site. For example, the website can make an audio book available to users who have the physical book. The printed, physical copy of the book is relatively expensive and time consuming to copy. Availability of the physical book is easier to control than the availability of digital content. Thus, using the printed hard to copy book to unlock an easy to copy digital data such as a CD permits the publisher of the media to control more effectively the digital content, while providing enhanced features to those having physical possession of the book.

The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram view of a reading system.

FIG. 2 is a flow chart depicting a technique for unlocking digital content embodied on a computer readable medium.

FIGS. 3-5 are flow charts of techniques useful in the unlocking technique of FIG. 2.

FIG. 6 is a block diagram of a network based system.

FIG. 7 is a flow chart of a network based unlocking technique.

FIG. 8 is a flow chart of a tag based unlocking technique.

FIG. 9 is a diagram depicting details of a reading machine configuration for the devices in FIG. 1 or FIG. 6.

DETAILED DESCRIPTION

Referring now to FIG. 1, a computing device 10, such as a personal computer, a personal digital assistant, a specialized computing device or a reading machine and so forth, includes a computer system 12. The computer system 12 is generally a personal computer or can alternatively be another type of computing machine and typically includes a central processor unit (CPU) 14. In addition to the CPU 14, the system includes main memory, cache memory and bus interface circuits (not shown). The computer system 12 includes a mass storage element 16, here typically the hard drive associated with personal computer systems.

The reading system 10 further includes a keyboard 18, a monitor 20, as well as speakers 22, a sound card (not shown), a pointing device such as a mouse (not shown) and a scanner 24 all coupled to various ports of the computer system 10, via appropriate interfaces and software drivers (not shown). The computer system 12 here operates under a Windows NT® Microsoft Corporation operating system. Alternatively, other operating systems could be used. The computer system 12 also includes a digital media carrier 17 that is read by the computer system 12 using a drive, e.g., a CD or CD/DVD drive, etc. (not shown).

The digital media carrier 17, such as a compact disk (CD), a web site (FIG. 6), etc, includes digital media unlocking software 30 and locked digital content 31. The digital media unlocking software 30 will be described as residing on the CD and will be loaded into the computer system 12 for execution, however other arrangements are possible, such as digital media unlocking software 30 being part of another application, or an operating system and so forth.

The digital media unlocking software 30 controls access to the locked digital content 31 stored on the digital media carrier 17. In one example, a CD includes a book or other type of printed publication on the CD as the locked digital content 31, and a user uses the digital media unlocking software 30 to unlock the locked digital content 31. Various levels of access can be provided by unlocking the CD. Unlocking the CD can include gaining access to a bitmap image of the contents, a text version of the contents, searching features, special enhancements, and so forth. Other uses are for unlocking digital content on the Internet, as described in FIGS. 6 and 7. Thus, in general the digital media unlocking software 30 unlocks the locked digital content whether the locked digital content 31 is locally stored such as on a CD or remotely stored such as on a web site on the Internet.

The unlocking software 30 requires that the user have possession of a physical copy of the, or corresponding to the content stored on the CD. In one example, the locked digital content 31 includes a book, and the unlocking software 30 requires that the user has physical possession of the book. It is presumed that if the user has a physical copy of the book, then the user bought the book and it is possible that the CD is not an unauthorized copy of the CD. That is, the approach presumes that whoever is reading the physical book owns the book, and is therefore likely authorized to be reading that copy of the book on the CD.

Referring to FIG. 2, a process 30 for unlocking digital content from a CD is shown. Initially, the process can start in a variety of ways. For example, a CD can be placed in a CD or DVD/CD drive and the like and an “auto play” feature of the CD drive/operating system can attempt to open a file on the CD. Alternatively, a user can attempt to open digital content on CD by selecting a file on the CD with a cursor and mouse, and so forth.

Upon receiving 42 an input to open a file with locked digital content, the process 30 selects 44 a test and/or a test type to send to user. The process 30 generates 46 the test from the selected test type and, in general, displays a user interface screen including the test and a field for the user to enter an answer to the test or the like. The user answers the test posed by process 30 and the process 30 evaluates 48 the answer supplied by the user to see if the answer was correct. Different embodiments 47 a-47 d of tests and evaluations 47 are described in FIGS. 3-5, and 8 below.

If the test was answered correctly, the process 30 unlocks 50 the digital content, e.g., by decrypting the file with the digital content or supplying a password to open the file. The process 30 renders 52 unlocked digital content. For example, the unlocked digital content can provide a text file, a text based searchable file, active content and so forth. If the test was not answered correctly an error message can be generated 54 asking the user to try again, and the number of times that the error message is generated can be tracked, so that if the error exceeds a threshold, the software 30 can become disabled.

The tests that the process 30 selects from are based on testing whether the user has physical possession of the book. The process 30 selects from a number of tests. In some implementations, the tests are selected at random whereas in others the tests are selected based on the type of user or the use to which the user desires to put the digital content. In other arrangements, the process 30 does not select a test type, but rather generates a specific test from a test type predefined for the process 30. Considerations for selecting the test are to make the test reasonably difficult to defeat and different each time to avoid cheating, but not so difficult such that intended users would not be able to figure out how to use the unlocking process 30.

Referring to FIG. 3, a test type 47 a involves a user with a reading machine, e.g., a personal computer or the like including reading software that receives a digital file containing text, and converts the digital file to synthesized speech and reads the contents aloud to the user. In reading machine applications (discussed in FIG. 9 below in greater detail), it is often common to have a scanner available with the reading machine. The test type 47 a using a scanned image, reads the CD and before proceeding, by displaying the digital content or making a text version of the digital content available, asks 62 the user to scan a particular page of the physical copy of the book. The user places the selected page on a scanner and scans the page. The scanner produces an image file that is received 64 by the process 47 a. The image file corresponds to the scanned page. The test type 47 a converts 66 the image file using Optical Character Recognition (OCR) processing and determines 68 whether the received image file sufficiently matches to the correct page or not.

If the process determines that user has the physical copy of the book, the test type 47 a unlocks 50 (FIG. 2) the CD and makes the digital content available. The process 47 a can proceed to allow the user to use the digital content for a period of time and after the period of time has elapsed, run the check again using a different page to make sure the user still has the book.

One way that the test type 47 a determines if there is a sufficient match is to produce a text file of the scanned page that is compared to a stored version of the text file on the CD to determine if the user has the book. Alternatively, the OCR can generate a signature that is compared to a signature generated from the stored file to determine whether the user had the book. Another way to determines if there is a sufficient match is to examine the content using OCR to recognize text on the page and use some heuristic to see if the recognized text matches sufficiently to the text stored in the file on the CD.

Another way to determine if there is a sufficient match would be to use less than a full OCR process to determine that the image page was sufficiently correct. With less than a full OCR processing, the test type 47 a need not determine whether text was recognized correctly, but merely whether the image file has a page arrangement, e.g., produced locations of probable text, that correspond to a stored version of the expected page arrangement, e.g., regions of probable text. Different gradations can be used. For instance, less than a full OCR processing can merely determine the number of lines of text, probable regions of text vs. images, arrangement of regions of text on the page, and so forth. Correspondence between the image and the stored version expected need only be close enough to assure that the user has the correct page and physical possession of the book.

The OCR need not be of high enough quality to recognize text, but rather need only show that the text is more or less arranged in the manner expected to be arranged on that page. This does not require a high quality scanner. As an alternative to a scanner, a user could a digital camera or a web cam. The unlocking could be based on detecting a certain pattern of color. The color is in some sense easier to detect, but is somewhat more difficult to copy.

Various approaches can be used to lock the digital content on the CD. For example, the digital content can be only partially locked so that a user can put the CD in the drive and the CD will generate thumbnails of the pages for display on the user's monitor. When the user wants to access one of the pages, the user would have to unlock the page using the unlocking software. The thumbnails provide limited access to allow the user to verify that the user has the correct CD.

Referring to FIG. 4, an alternate test type process 47 b is applicable to a more general case where a publisher wants allow users to unlock the digital content 31 of the file based on supplying content from the physical book. This test type 47 b need not have a scanner. Test type 47 b generates the test by randomly selecting 72 a page and a word position on the selected page. Test type 47 b stores 74 the word and page. Test type 47 b sends 76 a user interface instructing the user to open the book and turn to the specific, selected page and instructs the user to type or otherwise input into the computer the word that appears in the selected word position on the selected, specific page. Test type 47 b will receive the answer and check 78 to see if the answer was correct, thus providing or denying access.

The test type 47 b randomly generates 72 the selected page and word position each time the test type 47 b is executed. Alternatively, the test type 47 b can store a list of, e.g., 50 combinations of page numbers and words at selected word positions or the same word position, e.g., the first word on the page. The test type 47 b would generate a test based on one of these combinations each time the test type 47 b is executed. In addition to a word, phrases, sentences, and so forth, could also be used.

Referring to FIG. 5, an alternate test type 47 c includes a printed number on the bottom on the page generated by an algorithm and printed 82 during printing of the physical copy of the book. Each page of the physical copy of the book would have an apparently random number at, e.g., the bottom of the page. For example, the number can be computable based on the page number and a pseudorandom number generator using the page number as the seed. For example, the algorithm encrypts the page number, multiplies the page number by a key and hashes page number*key or any other technique to obscure the page number so long as the technique can be regenerated by the unlocking process 30.

When a user desired to obtain access to the content on the CD, the test type 47 c instructs 84 the user to open to a page and supply the number at the bottom of the page to the process. The test type 47 c uses the algorithm to reproduce 86 the number for verification 88 against the number entered by the user.

One of the features of test type 47 c is that the number does not need to be printed on the CD version of the book. Test type 47 c requires the user to have physical possession of the entire book to unlock the digital content on the CD consistently. Merely using words on the page, allows a user to see the words in the CD version, once the user opens the book. Thus, if the same word position is always used, the user could write the, e.g., first word on each page down and thus open the book, without necessarily always having physical possession of the book.

Many methods can be used to secure the number at the bottom of the page. For example, the page number can be run through an e.g., 128-bit encryption algorithm, producing a relatively large encrypted copy of the page number. That encrypted page number can be truncated (hashed) with a function that selects several of the digits, e.g., the last three digits from the number to produce the number that is printed on the bottom of the page. Another method that is simple to use would be to use a hash code to obscure the number.

Another variation that is relatively easy to implement (but also relatively easy to defeat) is to require the user to enter the ISBN number that is printed on nearly every book that is published. Without the physical book in hand, a user would not know the number. However, this implementation is much easier to cheat, since all that is needed is to write the number down somewhere and save it for later use. An alternative is to have the user's enter some item that appears on the cover of the book, since the cover will generally not be included in the digital content on the CD. Another alternative is to require the user to scan the bar code.

While printing a number on the bottom of the page is a relatively strong technique since it would require an unauthorized user to copy the entire book, publishers would have to incorporate the number in the printing process, which they might be reluctant to do. Using the ISBN number does not have this disadvantage since the number is already being printed. Another technique would be to imprint a single unique number on each physical copy of the book and produce CD's that are tied to that particular number.

Referring to FIG. 6, another arrangement 110 for unlocking digital content is with the Internet 118. FIG. 6 shows a client system 112 that accesses a website hosted on a web server 114 (or equivalent) over the Internet 118. The web server 114 can make available books on-line, for viewing by accessing files containing the books in a database 116, etc. Websites make books available on-line, for limited viewing to see if a user desires to purchase the book. The website stores scanned versions of books in the database, and stores text files that allow for searching of the scanned versions of the book. The scanned and text files are locked and can be unlocked using the process 130 described in FIG. 7. Process 130 is generally stored in storage 119 on server 114. In some embodiments, process 130 communicates with client system 112 via a browser 113.

Referring to FIG. 7, a process 130 for unlocking digital content on a web site is shown. A user accesses 132 a website, and searches for a book that is stored electronically on the web site. It is presumed that the user has physical possession of a physical copy of the book. The user may want to search the physical book and accesses the web site to use a search service that the website provides for user's who have physical possession of the book. The process 130 for unlocking digital content from the web site is initiated typically by the user using a variety of techniques.

The web site receives 134 a message over the Internet requesting to open a file with locked digital content. This message can result from the user conducting a search, finding electronic content of interest and selecting a link, e.g., a hyperlink, to open the electronic content. The process selects 136 a test and/or a test type to send to user, from the test type processes 47 a-c described above or process 47 d described below. The process generates 138 the test from the selected test type and sends 140 the user a web page or the like that includes the test and a field for the user to enter an answer to test. The user answers the test and sends the answer back to the web site. The process 130 evaluates 142 the answer supplied by the user to see if the answer was correct. If the test was answered correctly, the process 130 unlocks 144 the digital content on the web site, e.g., by decrypting the file storing the digital content and so forth. The process 130 renders 146 digital content in form for use by user or computer under the control of the user. If the user answered incorrectly, an error message can be returned 148.

Unlocked digital content can include a text file, a searchable, index, text file, and special or enhanced features such as an animated presentation using a media player, e.g., a Flash media player from (Macromedia, Inc. 600 Townsend Street San Francisco), etc. Unlocking digital content can also include unlocking audio content on a web site. For example, the website can make an audio book available to users who have the physical book.

The printed, physical copy of the book is relatively expensive and time consuming to copy. Availability of the physical book is easier to control than the availability of digital content. Thus, using a hard to copy source, such as the printed copy of the book to unlock an easy to copy digital data such as a digital content on a CD or a website permits the publisher of the media to control more effectively the digital content.

Unlike a license key, that often accompanies software; the unlocking process 30 or 130 can change every time an attempt is made to access the digital content, in order to verify that the user still has the copy of the book. Typically, the license key is used to unlock software once, during installation of the software on computer, and is in general, the same each time the license key is used. Thus, making copies of the license key and the CD allows for easy, often illegal distribution copies of the software. Whereas, with the unlocking process, the user must make an illegal copy of the entire physical book which while analogous to making copy of the key, is much more expensive and much more involved, than copying the key. Users would be less likely to copy the entire book and more inclined to buy the book.

Referring to FIG. 8, an alternative test type 47 d uses a machine readable code such as embedded in an RFID tag, bar code, or the like. RFID tags are generally affixed to merchandise for inventory tracking and the like. In general, each RFID tag has a different sequence number, code number, manufacturer's number, etc. RFID tags can be affixed to books. In this configuration, the RFID tag is configured at a minimum to identify the title of the book, and optionally a serial number for the book.

For client systems configured with RFID readers, the RFID tag unlocks the content by scanning 162 the tag with the RFID reader and comparing 164 the code read from the reader with a code retrieved 166 from a CD. For implementations using the Internet, a user would connect to the Internet and would be instructed to scan the RFID tag. The process would read the code from the tag and send the code to the website for verification to unlock extra content on a website like Flash presentation, searchable file or index as discussed above.

Some libraries use RFID tags for inventory management. If not done so already, it would be relatively simple for book publishers, particularly school book publishers, to add RFID tags with unique codes to books the books they print and distribute. Alternatively, in place of an RFID tag and RFID reader, a barcode and bar code reader could be used.

Referring to FIG. 9, a reading machine implementation 170 uses the digital unlocking process 30 or unlocking process 130 to unlock digital content. In the implementation 170, unlocked digital content is received 172 and the implementation determines 174 if the unlocked digital content is in the form of an image. If the unlocked digital content is in the form of an image, the unlocked digital content is operated 176 on by Optical Character Recognition software to produce text. If the content is already in a text format, a text file from the unlocked digital content is sent or text produced from the OCR software is send 178 to speech synthesizer software to render 180 audio aloud to the user. A display renders the digital content as text or image or other format, and highlighting is applied to the displayed representation in synchronization with the speech. The highlighting indicium could be bold, color, shading, italics, and so forth applied to the item as the item is read. The item could be a word, sentence, phrase, paragraph, etc.

Other representations can be stored in the digital media, for example bitmapped, and so forth. In addition, the digital media can include more interactive or active features that are unlocked. For example, the physical copy of, e.g., a book could include an illustration or picture on a page, and in the digital media could be an applet or other type of executable code and data that could animate the illustration on a display, with visual and audio additions which are not in the physical copy of the book.

A number of embodiments of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. For example, while the unlocking software has been described as residing on the same media as the digital content, it need not be. It could be a program or circuit that is executed or operated on a user's computing machine. In addition, while the content has been described in the context of a book, other forms of physical content can be used such as magazines, newspapers, periodicals and so forth. Accordingly, other embodiments are within the scope of the claims 

What is claimed is:
 1. A computer implemented method for unlocking digital content embodied in digital readable form on a digital media carrier, the method comprises: storing by a computer, an association of a printed publication with digital content; storing, by the computer, an arrangement of text from the associated physical copy of the printed publication; selecting randomly by the computer a page number of the printed publication; producing by the computer a test that includes the randomly selected page number of the printed publication to scan; changing by the computer the randomly selected page number of the test with each attempt at unlocking the digital content; receiving by the computer a request to unlock the digital content; transmitting, by the computer, the test; receiving, by the computer, an image; determining by the computer whether the image corresponds to the randomly selected page of the printed publication, by: applying optical character recognition to the received image to produce data that represents an arrangement of text in the image; and comparing the produced data representing the arrangement of text to the stored version of the arrangement; determining whether the received image corresponds to the randomly selected page of the printed publication, and based on the determination that the received image corresponds to the randomly selected page, unlock the digital content.
 2. The method of claim 1 wherein the optical character recognition produces a text representation of the image and comparing compares the text representation to a stored version of the text.
 3. The method of claim 1 wherein a digital camera or a web cam is used to scan the page and evaluating comprises detecting a pattern of color in the data from the digital camera or web cam; and determining whether the image corresponds to the randomly selected page of the digital content.
 4. The method of claim 2 wherein a scanner is used to scan the page.
 5. The method of claim 1 wherein unlocking the digital content provides a text based searchable file.
 6. The method of claim 2 further comprises: generating by the computer a signature from the text representation of the image; comparing by the computer the generated signature to a stored signature.
 7. The method of claim 1 wherein the digital content is a book stored on a digital storage media and the physical copy of the digital media is a printed copy of the book.
 8. The method of claim 1 wherein unlocking the digital content provides text.
 9. The method of claim 8 further comprising: apply speech synthesis that converts the text from the digital content into speech that is read aloud to the user.
 10. The method of claim 9, further comprising: displaying a representation of a portion of the digital media on a display; applying a highlighting indicia to an item in the displayed portion of the printed content, in synchronization with reading the item to the user.
 11. The method of claim 8 wherein the digital content includes a book stored in a digital representation on the digital storage media and the printed publication of the digital media is a printed copy of the book.
 12. The method of claim 8 wherein displaying displays an image representation from the digital media, and the method further comprises: applying speech synthesis that converts the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the image representation in synchronization with reading the text.
 13. The method of claim 1, further comprising: displaying a representation of a portion of the digital media on a display; using speech synthesis that converts text corresponding to the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the displayed portion in synchronization with text corresponding to the item being read.
 14. A computer readable hardware storage device that tangibly stores a computer program product, the computer program product that unlocks digital content embodied in digital readable form on a digital media carrier, the computer program product comprising instructions when executed by the computer causing the computer to execute the instructions for: storing an association of a printed publication with digital content; storing an arrangement of text from the associated physical copy of the printed publication; randomly selecting a page number of the printed publication; producing a test that includes the randomly selected page number of the printed publication to scan; changing the randomly selected page number of the test with each attempt at unlocking the digital content; receiving a request to unlock the digital content; transmitting the test; receiving an image; determining whether the image corresponds to the randomly selected page of the printed publication; applying optical character recognition to the received image to produce data that represents an arrangement of text in the image; and comparing the produced data representing the arrangement of text to the stored version of the arrangement; determining whether the received image corresponds to the randomly selected page of the printed publication; and based on the determination that the received image corresponds to the randomly selected page, unlock the digital content.
 15. The computer readable hardware storage device of claim 14 wherein the optical character recognition produces a text representation of the image and instructions for comparing, compare the text representation to a stored version of the text to determine that the scanned image corresponds to the selected page of the digital content.
 16. The computer readable hardware storage device of claim 14 wherein optical character recognition produces data that represents an arrangement of features in the image and instructions for comparing, compare the data representing the arrangement of features to a stored version of the arrangement to determine that the image corresponds to the randomly selected page.
 17. The computer readable hardware storage device of claim 14 wherein the image is a scanned image and a digital camera or a web cam is used to scan the page and further comprises instructions for: detecting a pattern of color in the data from the digital camera or web cam; and determining whether the scanned image corresponds to the randomly selected page.
 18. The computer readable hardware storage device of claim 14 wherein the unlocked digital content is a text based searchable file.
 19. The computer readable hardware storage device of claim 15 further comprises instructions for: generating a signature that is compared by the compare instructions to a signature generated from the digital media; and determining whether the user had the physical copy of the content.
 20. The computer readable hardware storage device of claim 14 wherein the unlocked digital content is text.
 21. The computer readable hardware storage device of claim 20 further comprising instructions for: applying speech synthesis that converts the text from the digital content into speech that is read aloud to the user.
 22. The computer readable hardware storage device of claim 20 further comprising instructions for: displaying a representation of a portion of the printed copy of the digital content on a display; applying a highlighting indicia to an item in the displayed portion of the printed content, in synchronization with reading the item to the user.
 23. The computer readable hardware storage device of claim 14 wherein the digital content includes a book stored in a digital representation on the digital storage media and the physical copy of the digital media is a printed copy of a book.
 24. The computer readable hardware storage device of claim 22 wherein a display displays an image representation from the digital media, and the computer program product further comprises instructions for: applying speech synthesis that converts the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the image representation in synchronization with reading the text.
 25. The computer program product of claim 14, further comprising instructions for: displaying a representation of a portion of the digital media on a display; applying speech synthesis that converts text corresponding to the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the displayed portion in synchronization with text corresponding to the item being read.
 26. An apparatus comprising: a computing processing device; and a computer readable hardware storage device storing a computer program product for unlocking digital content embodied in digital readable form on a digital media carrier, the computer program product comprises instructions when executed by the computing device causing the computing device to execute the instructions for: storing an association of a printed publication with digital content; storing an arrangement of text from the associated physical copy of the printed publication; randomly selecting a page number of the printed publication; producing a test that includes the randomly selected page number of the printed publication to scan; changing the randomly selected page number of the test with each attempt at unlocking the digital content; receiving a request to unlock the digital content; transmitting the test; receiving an image; determining whether the image corresponds to the randomly selected page of the printed publication; applying optical character recognition to the received image to produce data that represents an arrangement of text in the received image; and comparing the produced data representing the arrangement of text to the stored version of the arrangement; determining whether the received image corresponds to the randomly selected page and based on the determination that the received image corresponds to the selected page unlock the digital content.
 27. The apparatus of claim 26 wherein the optical character recognition produces a text representation of the scanned image and instructions for comparing, compare the text representation to a stored version of the text to determine that the scanned image corresponds to the selected page of the digital content.
 28. The apparatus of claim 26 wherein the optical character recognition produces data that represents an arrangement of features in the scanned image and instructions for comparing, compare the data representing the arrangement of features to a stored version of the arrangement to determine that the received image corresponds to the selected page of the digital content.
 29. The apparatus of claim 26, wherein the image is a scanned image and the apparatus further comprises: a digital camera or a web cam to produce the scanned image of the page and further comprises instructions for: detecting a pattern of color in the data from the digital camera or web cam; and determining whether the scanned image corresponds to the selected page of the digital content.
 30. The apparatus of claim 26 wherein the unlocked digital content is a text based searchable file.
 31. The apparatus of claim 26 wherein the computer program product further comprises instructions for: applying speech synthesis that converts the text from the digital content into speech that is read aloud to the user.
 32. The apparatus of claim 26 wherein the computer program product further comprises instructions for: displaying a representation of a portion of the printed copy of the digital media on a display; applying a highlighting indicia to an item in the displayed portion of the representation of the printed copy, in synchronization with reading the item to the user.
 33. The apparatus of claim 26, wherein displaying, displays an image representation from the digital media, and the computer program product further comprises instructions for: applying speech synthesis that converts the digital media into speech that is read aloud to the user, while applying a highlighting indicia to an item in the image representation in synchronization with reading the text. 